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Bayesian event reconstruction and background rejection in neutrino 
detectors 
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Abstract. Several large volume, high energy neutrino detec- 
tors are in operation or in the design stage. Upward going 
signal neutrino events must be separated from large back- 
grounds of downgoing cosmic ray induced atmospheric mu- 
ons. To this end, a Bayesian extension of the traditional max- 
imum likelihood reconstruction method will be described. 
Further, it will be shown how signals can be separated from 
backgrounds through integration of Bayesian posterior prob- 
ability densities. 



1 Introduction 

Recent years have seen the planning, construction and oper- 
ation of the first large volume, high energy neutrino detec- 
tors. The pioneering work of the DUMAND collaboration 
was followed by the successful deployment and operation of 
the Lake Baikal and AMANDA experiments, and the plan- 
ning and design studies for the ANTARES and Icecube de- 
tectors. These detectors use large arrays of optical modules 
to detect the Cerenkov radiation from upward moving muons 
that result from neutrino interactions that take place in the 
surrounding water/ice or rock. These signals must be sep- 
arated from a very large background of downward moving 
atmospheric muons. At the trigger level, the ratio of down- 
going muons to expected upgoing neutrino-induced muons 
can range from ^ 10^ for a surface detector, down to ~ 10^ 
for a deep underwater/ice detector 

What we will call the "conventional" method of separat- 
ing background and signal is to use a maximum likelihood 
method to fit a muon track arrival direction (based on the 
observed photon arrival times and densities in the optical 
module array) followed by the application of quality cuts to 
reduce the number of mis-reconstructed background events 
(fakes), while keeping a large fraction of signal. Once these 
data are reduced to a manageable size, they are compared to 



Correspondence to: Gary C. Hill 
ghill@alizarin.physics.wisc.edu 



Monte Carlo expectations for both signals and backgrounds, 
and the origins of the data events inferred. Generation of suf- 
ficient background statistics to demonstrate adequate rejec- 
tion power can be problematic, especially for highly "signal- 
like" background events, which are the hardest to reject. 

Ultimately, separation of neutrino events from back- 
grounds is all about hypothesis testing - given an observed 
event, which of a series of possible origins (downgoing mu- 
ons, upgoing neutrinos) is the most likely cause of the event? 
In the conventional method, we assess this by firstly reduc- 
ing the data set through a maximum likelihood reconstruc- 
tion, then by comparing the remaining data to Monte Carlo 
signal and background expectations, using the Monte Carlo 
to predict how many events of a given type are expected from 
both signal and background. At the reconstruction level we 
expect that the event sample is still background dominated. 
After strong quality cuts, we expect the sample to be signal 
dominated. In this paper, 1 describe how Bayesian inference 
can be used to take account of prior knowledge of the ex- 
pected signals and backgrounds at the reconstruction stage, 
leading to much greater initial background rejection than the 
conventional method. Finally, 1 briefly indicate how integra- 
tion of posterior probability densities can be used to directly 
assess the probability that an event is either signal or back- 
ground, providing a single cut parameter signal/background 
separation. 



2 Bayesian approach to event reconstruction 

In this section we first examine the conventional maximum 
likelihood approach to the separation of signal events from 
background events in a neutrino detector, and how the need 
to account for prior probabilities of hypotheses leads natu- 
rally to the Bayesian method. In the maximum likelihood 
method, the likelihood function alone is used to assess the 
origin of an observed event, with the implicit assumption that 
all hypotheses are a priori equally probable. We then propose 
a Bayesian approach to event reconstruction, where the prior 
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knowledge of the probabilities of the hypotheses is combined 
with the likeHhood function in assessing the most Ukely ori- 
gin of an event. 

2 . 1 The maximum likelihood method 

A neutrino detector (Andres et al. 2000) gives us infor- 
mation about the detected muon event in the form of event 
observables such as photon arrival times and numbers of de- 
tected photons per optical module. The UkeUhood function 
£j{E I H) describes the probability that any of the events 
from the total possible event set {E} come from any of the 
possible set of hypotheses {H}. These hypotheses could in- 
clude for example single muons, multiple muon bundles, un- 
corrected muons from separate cosmic ray primary events, 
or upward going neutrino induced muons. As an example, 
if we have a detector that measures photon arrival times with 
only a Gaussian timing error due to the photomultiplier tubes 
(the case for a non-scattering medium), the UkeUhood min- 
imisation reduces to the familiar minimisation. Usually in 
a neutrino detector the assumption is made that the hypothe- 
ses to be tested are single muons. The maximum likelihood 
method is used and the hypothesis space searched until the 
muon track hypothesis with the highest UkeUhood is found. 
In what follows we wiU assume we have a minimisation al- 
gorithm that will always find the global minimum in the hy- 
pothesis space. 

The events are reconstructed, and assigned their best fit 
parameters, one of which is the arrival direction, 9. Since we 
are interested in upgoing muons from neutrinos, a cut is made 
to keep only events with 6 > 90°. What we can infer about 
the origin of these events depends on what type of detector 
we have. There are three scenarios - 

1. The ideal detector Suppose the detector consisted of 
a billion optical modules on a one metre lattice instru- 
menting a cubic kilometre of detector medium. The 
event information would be so abundant and therefore 
the hypotheses constrained so strongly that there would 
be no doubt that the reconstructed direction was in fact 
the true direction of the originating true hypothesis Hf. 
Mathematically, 

L{E \H) H = Ht (1) 
= Hj^Ht (2) 

that is, the likelihood would only be non-zero for the 
true hypothesis. 

Monte Carlo simulations of such a detector would re- 
veal that the chance of any downgoing event misrecon- 
structing into the upward direction would be negUgible. 

2. The non-ideal detector At the other extreme we have 
the detector where Uttle if any constraint is placed on 

possible hypotheses, due to factors such as a sparsely 
instrumented detector volume, or a scattering medium 



where the Cerenkov light cone is smeared out com- 
pletely. All hypotheses would have essentially the same 
likelihood 

ii{E\Hi)^li{E\Hj) y i,j (3) 

and a reconstruction reveals nothing about the origin of 
the event. In this case, the best fit directions for the 
events would be essentially random, and all one could 
really say about a given event is that it was most likely 
to be due to a background event, since these overwhelm 
the signals completely. Again Monte Carlo simulations 
would confirm the inability of such a detector to sepa- 
rate signal and background. 

3. The realistic detector The final case is the realistic de- 
tector. In practise, we try not to construct non-ideal de- 
tectors, but can never afford to build the ideal detector. 
We are left with an in-between situation, where after the 
maximum UkeUhood reconstruction we still have back- 
ground events mis-reconstructed as upgoing events, but 
our Monte Carlo simulations tell us that signal can be 
separated out of the backgrounds by the application of 
quaUty cuts to the events to isolate the best upgoing 
events and reject the falsely reconstructed background 
events. 

2.2 Inferring the origin of reconstructed events in "reaUs- 
tic" detectors 

The conventional likelihood reconstruction method seeks to 
find the hypothesis with the highest UkeUhood to have pro- 
duced the observed pattern of event observables. During the 
search through the hypothesis space, all hypotheses are con- 
sidered as equally probable, and therefore the hypothesis that 
maximises the UkeUhood is taken as the best fit. For instance, 
if the final choice is between an upgoing fit and a downgoing 
fit with only a slightly smaller UkeUhood, the upgoing fit is 
chosen as the reconstructed track. However, if one considers 
that the rate of expected downgoing atmospheric muons is 
zenith dependent, and overall greater by many orders of mag- 
nitude than the expected rate of upgoing neutrino-induced 
muons, then it is not clear that the best choice of the recon- 
structed direction is upgoing. This situation is shown in fig- 
ure 1. The event E has been reconstructed, and a best up- 
going hypothesis Hu, with likelihood £j{E \ Hu), has been 
found. The best downgoing hypothesis is H^, with likeli- 
hood L{E I Hd). The a priori known rates of the hypotheses 
are denoted by the prior probabilities, P{Hu) and P{Hd)- 
Depending on the arrival direction of Hd, P{Hd) can be or- 
ders of magnitude greater than P{Hy). These prior proba- 
bilities must be taken into account in inferring the origin of 
the event. What we really need to know is the probability of 
the hypotheses, given the data, denoted P{H | E). Bayes' 
theorem (Bayes, 1763) tells us that the /7o.yfeWor probability 
P{H I E), is connected to the likelihood through the prior 
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function 



P{H I E) = 



where 



L{E I H)P{H) 
PiE) 



P{E) = j L{E\ H)P{H)dH 



(4) 



(5) 



Since P{E) is a constant (an integral over all possible hy- 
potheses), we can more simply state that the most probable 
hypothesis is the one that maximises the joint probabihty dis- 
tribution L{E I H)P{H), or that 



P[H I E) oc ii{E I H)P{H) 



(6) 




Best downgoing hypothesis: 
Likelihood: L{E \ Hd) 
Prior probability: P{Hd) 



Best upgoing hypothesis: Hu 
Likelihood: l^{E \ H„) 
Prior probability: P(Hu) 



Fig. 1. Event reconstruction in a neutrino detector. "Best" hypothe- 
ses refer to those that maximise the likelihood function £ | i?) 
alone, without consideration of the prior probabilities P{H). 



Bayes' theorem lets us account for the prior probabihties 

of the hypotheses, whereas the conventional reconstruction, 
through an implicit assumption of the uniformity of the hy- 
potheses, does not do this. Even if the hkehhood of the up- 
going hypothesis is the largest, if the prior probability of 
is sufficiently greater than that of Hu, such that 



!i{E I Hd)P{Hd) > L{E I Hu)P{Hu) 



(7) 



then we must reject the event as downgoing. However, if 
after accounting for the prior probabihties, the event appears 
to be upgoing, i.e. 



!i{E I Hu)P{Hu) > L{E I Hd)P{Hd) 



(8) 



there might still be another downgoing hypothesis Ha that 
satisfies 



LiE I Ha)P{Ha) > ^E I Hu)P{Hu) 



(9) 



making Ha the most likely origin of the event, forcing us 
to reject the event as downgoing. Cleary, in order to find 
the most likely origin of the event, we must find the hy- 
pothesis H that maximises the joint probabihty distribution 

!i{E I H)P{H). 



We simply incorporate prior information into the re- 
construction, by maximising iL(_E | H) P{H), instead of 
L{E \ H). A first attempt might be to make P{H) a sim- 
ple function of zenith angle, expressing the expected rates of 
downgoing atmospheric muons and neutrino-induced upgo- 
ing muons. During a search through the hypothesis space, 
downgoing hypotheses would be given more (zenith depen- 
dent) weight, with the result that more events would re- 
construct in the more probable downward direction. This 
Bayesian reconstruction has been used in the analysis of the 
AMANDA BIO data from the austral winter 1997 (Andres 
et al. 2001, De Young 2001, Wiebusch et al. 2001), where 
it has significantly improved the reconstruction level back- 
ground muon rejection, resulting in a simpler analysis, re- 
quiring fewer subsequent quality cuts to separate the neutrino 
signal. 

Having seen the importance of the prior probability P{H) 
in analysing the data in a realistic detector, we can now go 
back and see how Bayesian inference also explains the be- 
haviour of the ideal and non-ideal detectors discussed in sec- 
tion 2. 1, as the two extreme cases of the influence of the prior 
probability. In the case of the ideal detector the hypotheses 
are so strongly constrained that the likelihood is only non- 
zero for the true hypothesis Ht and zero for all other hy- 
potheses 



Si{E \H) ^ H = Ht 
= H^Ht 



(10) 
(11) 



As long as P{Ht) ^ 0, the posterior probability P{H \ E) is 
only non-zero for the true hypothesis, and the detector con- 
strains the hypotheses so strongly that prior knowledge of 
the hypotheses is irrelevant. The complete opposite is the 
case for the non-ideal detector. The detector places no con- 
straint on the hypotheses, the likelihoods of all E given all 
hypotheses are essentially equal 



LiE \ H{) ^ L{E \ H,) V i,3 
and therefore 

P{H I E) oc P{H) 



(12) 



(13) 



Since the detector cannot constrain the hypotheses at all the 
only inference that can be made is based on the prior proba- 
bilities, ie each event is more likely to be a background than 
a signal simply because backgrounds overwhelm signals. 

As expected the realistic detector hes in between the two 
cases discussed so far. Prior probabilities don't matter when 
analysing the ideal detector, but in the non-ideal detector 
prior probabihties tell us all we can know about an event. For 
the realistic detector, both the likelihood and the prior prob- 
ability must be used when inferring the origin of an event. 

Using this Bayesian reconstruction of course still 
only finds the hypothesis with the maximum value of 
L{E I H) P{H). The probability that a class of hypoflie- 
ses (for instance any upgoing hypothesis) was the cause of 
the event is found by integrating the posterior probability for 
that class. In the next section we discuss how integrating 
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these posterior probabilities separately over signal and back- 
ground hypotheses allows us to directly calculate the proba- 
bihty that any event is a signal or a background. 



3 Bayesian separation of signal and backgrounds using 
integrated posterior probabilities 

In this section we briefly discuss the Bayesian method for 
assessing the probability that an event is a signal or a back- 
ground. Elsewhere (Hill, 2001) these ideas are developed in 
greater detail. As our starting point, we restate Bayes' the- 
orem (equation 4) for the posterior probabiUty density for 
hypothesis H as the origin of event E: 



P{H I E) = 



where 



L{E I H)P{H) 



P{E) = j L{E\ H)P{H)dH 



(14) 



(15) 



is the event probabiUty density function (p.d.f.). We split the 

hypothesis space up into separate classes of signal and back- 
ground hypotheses. For example, the background hypotheses 
could be single minimum ionising muons, muons undergo- 
ing catastrophic energy losses (e.g. bremsstrahlung), mutiple 
muon bundles, or overlapping events. The signal hypotheses 
could be atmospheric neutrino induced muons, which travel 
upward through the earth. 

The prior probabilities of these hypotheses are denoted 
P{S) and P{B). The functions | S) and | B) de- 
scribe the likelihood of a given hypothesis to produce a set of 
detector observables (eg photon arrival times, photoelectron 
densities, time-over-thresholds). For completeness, the set of 
events {E} includes every type of event we could observe in 
the detector, right down to events where only detector noise 
contributes, and there is no contribution from, for example, a 
muon. 

Then Bayes' theorem for the posterior p.d.f. of the signal 
hypotheses is 



P{S I E) 



L{E , ,S')P(5') 



P{E) 

and for the background hypotheses 



PiB I E) 



L{E I B)P{B) 



P{E) 

where the event p.d.f. is 



(16) 



(17) 



P{E) = j L{E I S)P{S)dS + j L{E \ B)P{B)dB (18) 

The probabilty Ps that any of the signal hypotheses was the 
cause of E is found by integrating over the posterior p.d.f. 
P{S I E) to give 



Ps = j P{S I E)dS = 



JL{E I S)P{S)dS 
P{E) 



and Ukewise for backgrounds 

I B)P{B)dB 



Pb = j P{B\E)dB 
The ratio 

Ps _ j P{S I E)dS 



P{E) 



mE I S)P{S)dS 



Pb JP{B\E)dB J L{E \ B)P{B)dB 



(20) 



(21) 



gives the "betting odds" in favour of a signal origin over a 
background origin for any event. 
Also since 



Ps+Pb = l 



(22) 



(19) 



either Ps or Pi, gives a complete description of the probabil- 
ity of an event to be either signal or background and therefore 
serves as a single cut parameter 

We could, in principle, calculate the event probability for 
every observed event. In practice, this would be too time 
prohibitive to be of practical use. A better solution would be 
to initially use the Bayesian reconstruction to greatly reduce 
the data set, then calculate the integrated posterior signal and 
background probabihties for this smaU subset of events. 



4 Conclusions 

In this report we have addressed, from a Bayesian perspec- 
tive, the two critical questions faced when analysing neutrino 
telescope data - what is the most hkely arrival direction of an 
event, and what is the probabilty that subsequent upwardly 
reconstructed events are really upgoing signal and not mis- 
reconstructed downgoing atmospheric muons? To answer 
both questions, we have applied Bayes' theorem, which as- 
sesses the plausibility of hypotheses given observed data by 
combining prior knowledge of the hypotheses with the like- 
Uhood function. In the reconstruction case this results in 
more events reconstructing in the more probable downward 
direction. In the case of background rejection, we calculate 
the probability that a given event was upgoing by integrat- 
ing over posterior probabilities for all possible upgoing and 
downgoing hypotheses. 
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